AITopics | recurrent convolutional neural network

Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network

Neural Information Processing SystemsDec-24-2025, 20:27:57 GMT

Recent advances in the design of neural network architectures, in particular those specialized in modeling sequences, have provided significant improvements in speech separation performance. In this work, we propose to use a bio-inspired architecture called Fully Recurrent Convolutional Neural Network (FRCNN) to solve the separation task. This model contains bottom-up, top-down and lateral connections to fuse information processed at various time-scales represented by stages. In contrast to the traditional approach updating stages in parallel, we propose to first update the stages one by one in the bottom-up direction, then fuse information from adjacent stages simultaneously and finally fuse information from all stages to the bottom stage together. Experiments showed that this asynchronous updating scheme achieved significantly better results with much fewer parameters than the traditional synchronous updating scheme on speech separation. In addition, the proposed model achieved competitive or better results with high efficiency as compared to other state-of-the-art approaches on two benchmark datasets.

name change, recurrent convolutional neural network, speech separation, (5 more...)

Neural Information Processing Systems

Genre: Research Report (0.60)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

All Eyes, no IMU: Learning Flight Attitude from Vision Alone

Hagenaars, Jesse J., Stroobants, Stein, Bohte, Sander M., De Croon, Guido C. H. E.

arXiv.org Artificial IntelligenceJul-16-2025

Vision is an essential part of attitude control for many flying animals, some of which have no dedicated sense of gravity. Flying robots, on the other hand, typically depend heavily on accelerometers and gyroscopes for attitude stabilization. In this work, we present the first vision-only approach to flight control for use in generic environments. We show that a quadrotor drone equipped with a downward-facing event camera can estimate its attitude and rotation rate from just the event stream, enabling flight control without inertial sensors. Our approach uses a small recurrent convolutional neural network trained through supervised learning. Real-world flight tests demonstrate that our combination of event camera and low-latency neural network is capable of replacing the inertial measurement unit in a traditional flight control loop. Furthermore, we investigate the network's generalization across different environments, and the impact of memory and different fields of view. While networks with memory and access to horizon-like visual cues achieve best performance, variants with a narrower field of view achieve better relative generalization. Our work showcases vision-only flight control as a promising candidate for enabling autonomous, insect-scale flying robots.

artificial intelligence, machine learning, rotation rate, (17 more...)

arXiv.org Artificial Intelligence

2507.11302

Genre: Research Report (1.00)

Industry:

Transportation > Air (1.00)
Government > Military > Air Force (0.49)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network

Neural Information Processing SystemsJan-18-2025, 23:18:32 GMT

Recent advances in the design of neural network architectures, in particular those specialized in modeling sequences, have provided significant improvements in speech separation performance. In this work, we propose to use a bio-inspired architecture called Fully Recurrent Convolutional Neural Network (FRCNN) to solve the separation task. This model contains bottom-up, top-down and lateral connections to fuse information processed at various time-scales represented by stages. In contrast to the traditional approach updating stages in parallel, we propose to first update the stages one by one in the bottom-up direction, then fuse information from adjacent stages simultaneously and finally fuse information from all stages to the bottom stage together. Experiments showed that this asynchronous updating scheme achieved significantly better results with much fewer parameters than the traditional synchronous updating scheme on speech separation.

fuse information, recurrent convolutional neural network, speech separation, (2 more...)

Neural Information Processing Systems

Genre: Research Report (0.44)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.65)

Add feedback

Brief Review -- RCNN: Recurrent Convolutional Neural Network for Object Recognition

#artificialintelligenceAug-26-2022, 14:20:31 GMT

Similar idea is used in PolyInception Modules as in PolyNet, and PolyNet got 2nd Runner Up in ILSVRC 2016 Image Classification. 1989 … 2015 [RCNN] … 2021 [Learned Resizer] [Vision Transformer, ViT]…

brief review, object recognition, recurrent convolutional neural network, (1 more...)

#artificialintelligence

Genre: Overview (0.40)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.58)

Add feedback

LIMSI_UPV at SemEval-2020 Task 9: Recurrent Convolutional Neural Network for Code-mixed Sentiment Analysis

Banerjee, Somnath, Ghannay, Sahar, Rosset, Sophie, Vilnat, Anne, Rosso, Paolo

arXiv.org Artificial IntelligenceAug-30-2020

This paper describes the participation of LIMSI UPV team in SemEval-2020 Task 9: Sentiment Analysis for Code-Mixed Social Media Text. The proposed approach competed in SentiMix Hindi-English subtask, that addresses the problem of predicting the sentiment of a given Hindi-English code-mixed tweet. We propose Recurrent Convolutional Neural Network that combines both the recurrent neural network and the convolutional network to better capture the semantics of the text, for code-mixed sentiment analysis. The proposed system obtained 0.69 (best run) in terms of F1 score on the given test data and achieved the 9th place (Codalab username: somban) in the SentiMix Hindi-English subtask.

machine learning, natural language, tweet, (16 more...)

arXiv.org Artificial Intelligence

2008.13173

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > France (0.05)
North America > United States > California > San Diego County > San Diego (0.04)
(5 more...)

Genre: Research Report (0.40)

Industry: Information Technology (0.68)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Recurrent Convolutional Neural Networks help to predict location of Earthquakes

Kail, Roman, Zaytsev, Alexey, Burnaev, Evgeny

arXiv.org Machine LearningJun-3-2020

We examine the applicability of modern neural network architectures to the midterm prediction of earthquakes. Our data-based classification model aims to predict if an earthquake with the magnitude above a threshold takes place at a given area of size $10 \times 10$ kilometers in $10$-$60$ days from a given moment. Our deep neural network model has a recurrent part (LSTM) that accounts for time dependencies between earthquakes and a convolutional part that accounts for spatial dependencies. Obtained results show that neural networks-based models beat baseline feature-based models that also account for spatio-temporal dependencies between different earthquakes. For historical data on Japan earthquakes our model predicts occurrence of an earthquake in $10$ to $60$ days from a given moment with magnitude $M_c > 5$ with quality metrics ROC AUC $0.975$ and PR AUC $0.0890$, making $1.18 \cdot 10^3$ correct predictions, while missing $2.09 \cdot 10^3$ earthquakes and making $192 \cdot 10^3$ false alarms. The baseline approach has similar ROC AUC $0.992$, number of correct predictions $1.19 \cdot 10^3$, and missing $2.07 \cdot 10^3$ earthquakes, but significantly worse PR AUC $0.00911$, and number of false alarms $1004 \cdot 10^3$.

artificial intelligence, earthquake, machine learning, (16 more...)

arXiv.org Machine Learning

2004.0914

Country:

Asia > Japan (0.25)
Asia > Taiwan (0.04)
Asia > Pakistan (0.04)
Asia > China (0.04)

Genre: Research Report > New Finding (0.34)

Industry: Energy (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Deep Learning Techniques for Text Classification

#artificialintelligenceAug-29-2019, 07:12:13 GMT

Deep learning models have achieved state-of-the-art results across many domains. RMDL solves the problem of finding the best deep learning structure and architecture while simultaneously improving robustness and accuracy through ensembles of different deep learning architectures. RDMLs can accept a variety of data as input including text, video, images, and symbols.

architecture, classification, neural network, (12 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Visual Depth Mapping from Monocular Images using Recurrent Convolutional Neural Networks

Mern, John, Julian, Kyle, Tompa, Rachael E., Kochenderfer, Mykel J.

arXiv.org Artificial IntelligenceDec-10-2018

A reliable sense-and-avoid system is critical to enabling safe autonomous operation of unmanned aircraft. Existing sense-and-avoid methods often require specialized sensors that are too large or power intensive for use on small unmanned vehicles. This paper presents a method to estimate object distances based on visual image sequences, allowing for the use of low-cost, on-board monocular cameras as simple collision avoidance sensors. We present a deep recurrent convolutional neural network and training method to generate depth maps from video sequences. Our network is trained using simulated camera and depth data generated with Microsoft's AirSim simulator. Empirically, we show that our model achieves superior performance compared to models generated using prior methods.We further demonstrate that the method can be used for sense-and-avoid of obstacles in simulation.

artificial intelligence, depth map, machine learning, (18 more...)

arXiv.org Artificial Intelligence

1812.04082

Genre: Research Report (0.64)

Industry:

Transportation (0.50)
Aerospace & Defense > Aircraft (0.37)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

AI can untangle the jumble of neurons packed in brain scans

#artificialintelligenceJul-18-2018, 05:20:46 GMT

Video AI can help neurologists automatically map the connections between different neurons in brain scans, a tedious task that can take hundreds and thousands of hours. In a paper published in Nature Methods, AI researchers from Google collaborated with scientists from the Max Planck Institute of Neurobiology to inspect the brain of a Zebra Finch, a small Australian bird renowned for its singing. Although the contents of their craniums are small, Zebra Finches aren't birdbrains, their connectome* is densely packed with neurons. To study the connections, scientists study a slice of the brain using an electron microscope. It requires high resolution to make out all the different neurites, the nerve cells extending from neurons.

ai researcher, artificial intelligence, machine learning, (7 more...)

#artificialintelligence

Genre: Research Report (0.53)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

Add feedback

AI can untangle the jumble of neurons packed in brain scans

#artificialintelligenceJul-18-2018, 05:20:46 GMT

Video AI can help neurologists automatically map the connections between different neurons in brain scans, a tedious task that can take hundreds and thousands of hours. In a paper published in Nature Methods, AI researchers from Google collaborated with scientists from the Max Planck Institute of Neurobiology to inspect the brain of a Zebra Finch, a small Australian bird renowned for its singing. Although the contents of their craniums are small, Zebra Finches aren't birdbrains, their connectome* is densely packed with neurons. To study the connections, scientists study a slice of the brain using an electron microscope. It requires high resolution to make out all the different neurites, the nerve cells extending from neurons.

artificial intelligence, machine learning, neuron, (8 more...)

#artificialintelligence

Genre: Research Report (0.53)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

Add feedback

Filters

Collaborating Authors

recurrent convolutional neural network

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network

All Eyes, no IMU: Learning Flight Attitude from Vision Alone

Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network

Brief Review -- RCNN: Recurrent Convolutional Neural Network for Object Recognition

LIMSI_UPV at SemEval-2020 Task 9: Recurrent Convolutional Neural Network for Code-mixed Sentiment Analysis

Recurrent Convolutional Neural Networks help to predict location of Earthquakes

Deep Learning Techniques for Text Classification

Visual Depth Mapping from Monocular Images using Recurrent Convolutional Neural Networks

AI can untangle the jumble of neurons packed in brain scans

AI can untangle the jumble of neurons packed in brain scans